Multi-speaker Recognition in Cocktail Party Problem
نویسندگان
چکیده
This paper proposes an original statistical decision theory to accomplish a multi-speaker recognition task in cocktail party problem. This theory relies on an assumption that the varied frequencies of speakers obey Gaussian distribution and the relationship of their voiceprints can be represented by Euclidean distance vectors. This paper uses Mel-Frequency Cepstral Coefficients to extract the feature of a voice in judging whether a speaker is included in a multi-speaker environment and distinguish who the speaker should be. Finally, a thirteen-dimension constellation drawing is established by mapping from Manhattan distances of speakers in order to take a thorough consideration about gross influential factors.
منابع مشابه
Speech separation by simulating the cocktail party effect with a neural network controlled Wiener filter
A novel speech separation structure which simulates the cocktail party e ect using a modi ed iterative Wiener lter and a multi-layer perceptron neural network is presented. The neural network is used as a speaker recognition system to control the iterative Wiener lter. The neural network is a modi ed perceptron with a hidden layer using feature data extracted from LPC cepstral analysis. The pro...
متن کاملSpeaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments
Speech recognition in cocktail-party environments remains a significant challenge for state-of-the-art speech recognition systems, as it is extremely difficult to extract an acoustic signal of an individual speaker from a background of overlapping speech with similar frequency and temporal characteristics. We propose the use of speaker-targeted acoustic and audio-visual models for this task. We...
متن کاملImproving Source Separation via Multi-Speaker Representations
Lately there have been novel developments in deep learning towards solving the cocktail party problem. Initial results are very promising and allow for more research in the domain. One technique that has not yet been explored in the neural network approach to this task is speaker adaptation. Intuitively, information on the speakers that we are trying to separate seems fundamentally important fo...
متن کامل\eigenlips" for Robust Speech Recognition \eigenlips" for Robust Speech Recognition
In this study we improve the performance of a hybrid connectionist speech recognition system by incorporating visual information about the corresponding lip movements. Speciically, we investigate the beneets of adding visual features in the presence of additive noise and crosstalk (cocktail party eeect). Our study extends previous experiments by using a new visual front end, and an alternative ...
متن کاملAuto-associative Memory: The First Step in Solving Cocktail Party Problem
One of the most interesting and challenging problems in the area of Artificial Intelligence is solving the Cocktail Party problem. This is the task of attending to one speaker among several competing speakers and being able to switch the attention from one speaker to another at any given time. Human brain is remarkably efficient in solving this problem. There have been numerous attempts to emul...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1712.01742 شماره
صفحات -
تاریخ انتشار 2017